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Processive Sugar Transferase 

The invention relates to the use of processive UDP-sugar: l,2-diacylglycerol-3-p-, UDP- 
sugar: 3-P-l,3'-phospho-5'«-glycerol-r,2'-diacyl-^rt-glycerol)- and UDP-sugar: 3-[0-p-D- 
glucopyrailosyl]-^w-glycerol-l ,3'-phospho-r,2'-diacyl-5w-glycerol-D-sugar transferases and 
similar proteins as well as the corresponding coding nucleic acids for the manipulation of the 
contents and/or the structure of glycosyldiacylglycerols and/or the synthetic secondary 
products thereof, as vs^ell as other substrates which are glycosylated by these enzymes, in 
transgenic cells and/or organisms. 

Glycosyldiacylglycerols were produced enzymatically by means of a sugar transferase 
(glycosyl transferase). For this purpose, the gene coding for a UDP-sugar transferase was 
isolated from genomic DNA of Bacillus subtilis and Staphylococcus aureus, and cloned into, 
and expressed in, E, coli. The activity of the enzymes was confirmed by means of specific in 
vzYro-enzyme assays. The products were also detected and identified in lipid extracts of 
transgenic E. coli cells. The products are various novel glycolipids having different number of 
glucose residues (maximum of 4) linked via a p(l— >6)glycosidic bond, and utilizing 
diacylgylcerol (DAG) or phosphatidylglycerol (PG) as the primary acceptor. 

In addition, these novel glycolipids comprise two differently structured novel 
phosphoglycolipids (PLl and PL2) with a different number of glucose residues (maximum of 
two) which in the case of (PL2) are also linked via a P(1^6)glycosidic bond and utilize 
phosphatidylglycerol as the acceptor (i.e. both diastereomers, i.e. with respect to the 
configuration of the non-acylated glycerol residue). The glycosyl residues may further be 
acylated in position 6'" of the terminal glucose: 

1 ) MGlcD: 3-[0-p-D-glucopyranosyl]-l ,2-diacylglycerol (Staphylococcus aureus ypfP) 

2) DGlcD: 3-[0-P-D-glucopyranosyl-(l— >6)-0-P-D-glucopyranosyl]-l,2-diacylglycerol 

3) TGlcD: 3-[0-p-D-glucopyranosyl-(l->6)-0-p-D-glucopyranosyl-(1^6)-0-p-D- 
glucopyranosyl]-! ,2-diacylglycerol 
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4)TeGlcD: 3-[0-p-D-glucopyranosyl-(1^6)-0-p-D-glucopyranosyl-(l->6)-0-p-D- 
glucopyranosyl-(l— >6)-0-P-D-glucopyranosyl]-l,2-diacylglycerol 



5) Phospholipid 1: 3-[0-p-D-glucopyranosyl]-5n-glycerol-l,3'-phospho-r,2'-diacyl-5'w- 
glycerol) 

6) Phospholipid 2:{3-[0-(6"*-0-acyl)-p-D-glucopyranosyl-(r"->6")-0-p-D-glucopyranosyl]- 2- 
acyl-^«-glycerol- 1 ,3'-phospho- 1 ',2'-diacyl-^w-glycerol} 

Note: The numbering of the glycerol residues I (Gro') and II (Gro") corresponds herein to the 
numbering 1-3 and V-3\ respectively, i.e. Gro^ is "left-hand" and Gro" is "right-hand" in 
accordance with Figure 13, 

Surprisingly, the enzymes act in a processive manner, i.e. all detected novel glycolipids are 
formed by successive addition of UDP-glucose to the respective preceding product of the 
enzymes. Further, alkyl-P-D-glucosides, ceramides (both enzymes), sterols and sterol 
glucosides (only the enzyme of S. aureus) are used as acceptors for a further glucosylation 
reaction. 



Glyceroglycolipids represent a group of membrane components which are very hetero- 
geneous with respect to their structure. They are found in bacteria (Kates, 1 990), plants and in 
very low amounts also in animals. Many structures especially of bacterial glycolipids have 
already been described many years ago (Kates, 1999), however, none of the genes 
synthesizing these glycolipids have been cloned, so that these substances can be obtained 
from the corresponding organisms only in analytical amounts. Only at the beginning of 1997 
was the first pubHcation issued, wherein the cloning and expression of a plant galactose : 1 ,2- 
diacylglycerol galactosyl transferase is described (Shimojina et al., 1997). However, this 
enzyme is no "processive" glycosyl transferase. 




Database searches in the "U.S. Patent Database" revealed that two further patents relating to 
glycosyl transferases exist: Patent No. 5 545 554: Glycosyl transferases for biosynthesis of 
oligosaccharides, and genes and encoding them, and Patent No. 5 641 668: Proteins having 
glycosyl transferase activity. It appears that the first-mentioned patent only relates to glycosyl 
transferases which synthesize oligosaccharide, so that this patent is not relevant with respect 
to the enzyme, viz a lipid glycosyl transferase, described herein. The second-mentioned patent 
relates to glycosyl transferases in general, in view of which the processive enzyme described 
in this specification is novel. 



Glycosyl diacylglycerols are naturally occurring compounds found in plants, animals and 
bacteria. However, an inexpensive, large-scale production of these compounds was not 
possible so far, since corresponding genes were not yet cloned. Glycosyl diacylglycerols can 
be used in a variety of applications, depending on the number of sugar residues and the 
structure of the fatty acids. 

When esterified with usual CI 8 unsaturated fatty acids, diglucosyl diacylglycerols have 
emulsifier properties which are useful in food industrial applications (in mayonnaise, 
margarine, ice cream, confectionery etc.). 

In the presence of highly unsaturated fatty acids, glycolipids may be introduced into 
polymers, which then obtain new characteristics and surfaces. Finally, glycosyl 
diacylgylcerols may obtain detergent characteristics, when the fatty acid chain length is 
drastically shortened. This would already now be possible in transgenic rape seed with 
predominant lauric acid. Such detergents could be produced in large amounts in an 
inexpensive manner, and such detergents would be biologically degradable. 

The phospholipids which are glucosylated by the enzyme of S, aureus receive new physico- 
chemical characteristics due to the charge of the phosphate residue between the two glycerol 
residues on the one hand, and on the other hand due to acylation of the sugar residue(s). Thus, 
by use of the described processive sugar transferases, not only neutral lipids, but also charged 
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glycolipids can be specifically produced and varied. Thus, a further class of charged glyco- 
phospholipids are developed via the sugar transferases. 

In the production of plant oils from oil seeds, a lecithin fraction is obtained, wherein phospho- 
lipids and glycolipids are accumulated. By over-expressing the genes disclosed in this 
specification in these plants, a variety of glycolipids (glucosyl diacylglycerols, steryl 
glucoside, glucocerebroside and other lipids described herein) could be concentrated, with a 
favourable effect on the baking properties of bakery products, to which the lecithin fraction is 
added. 

In addition, the phospholipids glucosylated by the S. aureus enzyme receive further physico- 
chemical properties due to the charge of the phosphate residue. 

This invention, therefore, relates to a process for the production of glycolipids in transgenic 

cells and/or organisms, comprising the following steps: 

transfer of a nucleic acid molecule that codes for a protein having the biological 
activity of a processive diacylglycerol glycosyltransferase to the cells or organism, 
expression of the protein having a biological activity of a processive diacylglycerol 
glycosyltransferase under suitable regulatory sequences in the cells or the organism, 
and 

if desired, recovery of the glycolipids synthesized by the biological activity of a 
processive diacylglycerol glycosyltransferase from the cells or the organism. 

In a preferred embodiment of the invention, the nucleic acid molecule codes for a protein 
having the biological activity of a processive diacylglycerol glycosyltransferase from Bacillus 
subtilis or Staphylococcus aureus. 

The transgenic cells may be any cells that are useful for the production of the new glycolipids, 
preferably the cells are plant, yeast or bacteria cells. The transformed organism is preferably a 
plant, yeast or bacterium. 

As mentioned above and as will be clear from the following description, the glycolipids 
produces by the process of the invention are preferably glycosyl diacylglycerols and/or 



phosphoglycolipids. More preferably, the glycolipids are monoglycosyldiacylglycerol, 
diglycosyldiacylglycerol, triglycosyl diacylglycerol, tetraglycosyldiacylglycerol, glycosyl 
ceramide, diglycosyl ceramide, steryl glycoside, steryl diglycoside, glycosyl phosphatidyl- 
glycerol, and/or diglycosyl phosphatidylglycerol. Most preferably, the glycolipids are 
monoglucosyldiacylglycerol, diglucosyldiacylglycerol, triglucosyldiacylglycerol, tetra- 
glucosyldiacylglycerol, glucosyl ceramide, diglucosyl ceramide, steryl glucoside, steryl 
diglucoside, glucosyl phosphatidylglycerol, and/or diglucosylphosphatidylglycerol. 

The invention is also directed to the use of a nucleic acid molecule coding for a protein 
having the biological activity of a processive diacylglycerol glycosyltransferase or of a 
proteins having the biological activity of a processive diacylglycerol glycosyltransferase for 
processive glycosylation, in particular for production of glycolipids. Processive glycosylation, 
in particular the production of glycolipids, may take place in vivo or in vitro. 

Further, the invention is directed to tetraglucosyldiacylglycerol, synthesized and described 
herein for the first time. The same applies to glucosylphosphatidylglycerol and 
diglucosylphosphatidylglycerol. 

The invention is further directed to the use of the glycolipids produced by processive 
glycosylation according to the invention in the food industry, as an emulsifier or as a 
detergent. 

A processive glycosyl transferase, as described herein, catalyzes the successive transfer of 
one or more hexose residues to an acceptor molecule. In particular the enzyme catalyzes 
at least one of the following reactions: 

a) addition of hexose p(l— >6) to diacylglycerol, 

b) addition of hexose (3(1 ->6) to a MHexD, 

c) addition of hexose p(l— >6) to a DHexD, 

d) addition of hexose p(l— >6) to a THexD, 

e) addition of hexose P(l->6) to a TeHexD, 

f) addition of hexose p to a ceramide, 

g) addition of hexose P (1— >6) to a monohexosyl ceramide. 



h) addition of hexose P to a sterol, 

i) addition of hexose P(l->6) to a steryl glucoside, 

j) addition of hexose in P-giycosidic linkage to the primary hydroxyl group of 
phosphatidylglycerol, 

k) addition of hexose P(l— >6) to the first hexose of phosphatidylglycerol-P-D- glucoside. 

In paricular the glycosyl transferase catalyzes the successive transfer of one or more hexose 
residues to at least one acceptor molecule for synthesis of glycolipids, in particular 
phosphoglycolipids, in particular catalyzing one of the following reactions: 

a) addition of Glc P(1^6) to a diacylglycerol, 

b) addition of Glc p(1^6) to a MGlcD, 

c) addition of Glc P(l ->6) to a DGlcD, 

d) addition of Glc p(1^6) to a TGlcD, 

e) addition of Glc P(1^6) to a TeGlcD, 

f) addition of Glc p to a ceramide, 

g) addition of Glc P(l— >6) to a monoglucosyl ceramide, 

h) addition of Glc P to a sterol, 

i) addition of Glc P(l— >6) to a steryl glucoside, 

j) addition of Glc in P-(l— >6)-glycosidic linkage to the primary hydroxyl group of 
phosphatidylglycerol, 

k) addition of Glc p(1^6) to the first Glc of phosphatidylglycerol-p-D-glucoside. 

The invention also relates toSXhc DNA sequences coding for an protein having enzyme activity 
of a processive glycosyl transiferase from Bacillus subtilis and/or Staphylococcus aureus. 
Further the invention is directedNp DNA sequences coding for a protein v^hich shows at least 



%, preferably at least 70 %, more preferably at least 90 %, and most preferably at least 



sequence codes for a protein having mow-e than 5 amino acids within the amino acid sequence 
EHQPDIII which are identical with the ahiino acid sequence of the proteins from B. subtilis 
and/or S. aureus, preferably having more man 6 amino acids within the amino acid sequence 
QVVVVCGKN or the amino acid sequenceV)CMITKPG which are identical with the amino 
acid sequence of the proteins from B. subtilis and/or S. aureus. Most preferably, the DNA 




'5 % identity with the deduced protem of ypfP (Clustal X). More particular, the DNA 
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sequence codes &r a protein the amino acid sequence of which comprises the amino acid 
sequence MITKPQGITxTE (wherein x is any amino acid), or the amino acid sequence 
VKxTGIPI (whereiiTK is any AA) or the amino acid sequence of which comprises more than 
5 amino acids within tnfe sequence ZPDIIIxxxP (wherein Z represents the amino acid Q or K 
and X is any amino acid) Vhich are identical to the sequence found in Bacillus subtilis and/or 
Staphylococcus aureus. \ 

The invention also relates to the use of a processive glycosyltransferase for biosynthetic 
production of glycolipids having the following structure: 

a) p-D-Glcp-(1^6)-p-D-Glcp-(l-^6)-p-D -Glcp-(l->6)- p-D-Glcp-(1^6)-Gro, 

b) 3-[0-p-D-Glucopyranosyl]- phosphatidylglycerol (PLl), or 

c) {3-[0-(6"'-0-acyl)-p-D-glucopyranosyl-(l"'^6")-0-p-D-glucopyranosyl]-2-acyl- 
phosphatidylglycerol} (PL2). 

Finally, the invention is directed to secondary products which are produced by 
biosynthetically and gentechnically engineered microorganisms and/or plants using a 
processive glycosyl transferase by further conversion of the products produced by the action 
of the processive sugar transferases, in particular by addition of a fatty acid to the position 6'" 
of the terminal hexose in {3-[0-P-D-glucopyranosyl-(r"-^6")-0-P-D-glucopyranosyl]-2- 
acyl-phosphatidylglycerol} . 



1. Isolation and cloning of ypfP 

ThQypfP gene was isolated from B. subtilis , the gene being described in the SubtiList 
Database as an open reading frame of unknown function (accession number P54166). The 
other gene that was isolated and cloned was a sequence from Staphylococcus aureus 
(accession number Y 143 70) described as an open reading frame of unknown function. 




For DNA isolation, restriction analysis and ligation, standard techniques were used 
(Sambrook et al., 1989). Genomic DNA from Bacillus subtilis 019 was isolated according to 
Cutting et al., 1990. Genomic DNA of S. aureus was provided by Prof. Dr. Witte, (Robert 
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Koch-Institute, Postfach 650280, 13302 Berlin). Restriction endonucleases and DNA- 
modifying enzymes were purchased from New England Biolabs and Boehringer Mannheim, 
and used as recommended by the suppliers. 

E. coli XLl BluA(MRF') (Stragene), E, coli BL21 (DE3) (Novagen) and Bacillus substilis 
019 were grown aX37°C in a Luria Broth (LB) (Sambrook et aL, 1989). For plasmid-bearing 
E. coli strains, the amibiotics ampicillin (100 \xg ml') and kanamycin (30 |ig ml*) were 
included in the medium. The vectors pUC18 (Yanish-Perron et al., 1985) and pET24c(+) and 
pET24d(+)(Novagen) were used as cloning vectors. The ypfP genes were isolated from 
genomic DNA of B. subtnds and S, aureus by PGR. For this purpose the specific primers PJl 
(5'-CCGAGCTCC CAT aVjAATACCAATAAAAGAG 3') and PJ2 (5' TCCGGATCC 
TTACGATAGCACTTTGGy for B. substilis ypJP and the primers PJIO 5' TTCC 
ATGGTTACTCAAAATAAaVaGATATTG 3^ and 

PJl 1 5' TTTGGATCC TTATTtV^CGAAGAATCTTGCATATAA 3' for the S. aureus gene 
{say) were used, the underlined part of which annealed to the 5' and 3' end of the ypfPlsay 
genes. The following amplification ptogram was used: 10 min at 94°C; 30 cycles of 0.5 min 
at SS'^C and 60°C for S. aureus ypJP, r^ectively, 2 min at 72''C, 1 min at 94°C; one cycle of 
10 min at 72°C. Pwo-polymerase (Boehmiger) was used for the amplification of the 1 170 bp 
product of the genomic DNA of B. subtiliXpfu-polymemsG (Stratagene) was used for the 
amplification of the 1 190 bp product from SSaureus genomic DNA. The amplified genes 
were cloned into 5'/waI-linearized pUC18 vectbr, resulting in pypJP3 and psayl. For 
construction of the expression vectors pEypfP 2X and pEsay24, the ypfP fragments were 
released by BamHI and Ndel and Ncol digestion, respectively, from pypJP3 and psayl, and 
inserted into BamHI-, Ndel- and A^col-linearized p]^24c(+) and pET24d(+), respectively. E. 
coli XLl Blue (MRF') was transformed with pypJP3\nd psayl and E. coli BL21 (DE3) was 
transformed with pEypfP24 and pE^ay24. Correct in-frame cloning was confirmed by 
sequencing. One strand of the DNA of pypfP3 and p^aylwas sequenced using the dideoxy 
method (automatic sequencer 373A and 377, Applied Biosystems), For computer analysis of 
the sequences, Clone manager for Windows 4,1 (Scientific and Educational Software) was 
used. Database searches were performed using the BLAST algorithm (Altschul et al., 1990). 
Sequence alignments were performed using Clustal X (Higgins and Sharp, 1988). 



2. Expression of the ypJP/say-genes 



For expression of the genes, ypfP was cloned into pET24c(+) and pET24d(+), respectively, 
and E.coli BL21 (DE3) was transformed with the resulting constructs pEypfP24 and pEsay24. 
Pre-cultures of coli BL21 (DE3), E, coli BL21 (DE3) pEypJP24 and E. coli BL21 (DE3) 
ipEsay2A were grown overnight at 37°C, and expression cultures were started at an optical 
density (O.D.)580 of 0.05. Induction was performed by adding 0.4 mM IPTG at an optical 
density of 0.8 and further incubation for 2 h at 37^C. All subsequent steps were carried out at 
4°C. Cells were collected by centrifugation (15 min, 5000x g). The cell pellet was re- 
suspended in a buffer 1 (50 mM Tris-HCl, pH 8.0; 20% (v/v) glycerol) (4% of the volume of 
the expression culture). The cells were frozen and sonicated after thawing (3x 40s; Braun, 
Labsonic 2000). Inclusion bodies were collected by centrifugation (15 min, 4000x g) and the 
supernatant was then divided into the membrane fraction and the soluble supernatant (5. 
subtilis ypfP) by ultrasonification (1 h, 147000x g). 

The inclusion bodies, the membrane fraction and the soluble supematant were separated in an 
SDS-PAGE. The SDS-PAGE was carried out as described by Laemmli, 1970, and the gels 
were stained with Coomassie brilliant blue R250 (Serva). 

By SDS-PAGE analysis, over-expression of a protein having an apparent molecular mass of 
44 kDa could be identified in the membrane fraction and the inclusion body fraction. The 
molecular weight corresponds to the calculated mass of 43.6 kDa or 44.7 kDa, respectively, 
for YpfP. This protein was not present in the soluble fraction and in untransformed E, coli. 

3. Lipid extraction and analysis 

Expression cultures of E. coli BL21 (DE3) pEypfP24 and pE^ay 24 and cultures of the late 
logarithmic growth stage of Bacillus subtilis 019 were harvested by centrifugation (15 min, 
5000x g), and the sedimented cells boiled for 10 min in water. Lipid extraction was performed 
as described by Linscheid et al., 1997. For separation of individual lipids by preparative 
chromatography, the lipids were subjected to thin-layer chromatography in the following 
solvent systems: (1) chloroform/methanol/H20 (70:30:4. v/v/v) for separation of MGlcD, 
DGlcD, TGlcD and TeGlcD from phospholipids; (2) diethyl ether/petroleum ether (2:1, v/v) 
for separation of acetylated DGlcD from non-acetylated DGlcD; (3) diethyl ether/petroleum 
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for separation of acetylated DGlcD from non-acetylated DGlcD; (3) diethyl ether/petroleum 
ether (4:1, v/v) for separation of acetylated TGlcD from non-acetylated TGlcD; (4) 
chloroform/acetone (9:1, v/v) for separation of acetylated TeGlcD from non-acetylated 
TeGlcD. 

Separation of the two acetylated phosphoglucolipids PLl and PL2 was performed in the 
solvent chloroform/methanol (80:20, v/v). Then both acetylated lipids were extracted from the 
silica gel and re-suspended in chloroform. The lipids were methylated by addition of 
diazomethane and subsequently separated in the solvent toluene/methanol (9:1, v/v). 

Acetylation of the glycolipids was performed as described by TuUoch et aL, 1973. Synthesis 
of the fatty acid methyl esters from DGlcD with sodium methylate was performed according 
to Roughan and Beevers, 1981. Release of the fatty acid from the sn 1 position of DGlcD was 
achieved by incubation with Rhizopus Lipase (Boehringer) according to suppliers' protocol. 
Incubation with Cerebrosidase (provided by Prof. Dr. Sandhoff, University Bonn) was 
performed as described by Vaccaro et aL, 1993. 

The lipid extracts of E. coli BL21 (DE3) ipEypfPlA and pEsay24 showed various new glyco- 
lipids, which could not be detected in the wild-type (Fig. 1). These glycolipids reacted with a 
sugar-specific spray reagent, but they were ninhydrin and phosphate negative (the native PLl 
and PL2 were phosphate positive). One of the glycolipids co-migrated with a diglucosyl 
diacylglycerol (DGlcD) standard of B. cereus. The different glycolipids were purified and 
acetylated. The glycolipid band with the polarity of DGlcD also co-migrated after acetylation 
with the acetylated DGlcD standard of B. cereus, 

4. Analysis of the new glycolipids by MS and NMR 

Mass spectrometric (MS) and nuclear magnetic resonance spectroscopic (NMR) analysis of 
the new glycolipids was exclusively performed with the per-O-acetylated derivatives (1, 2, 3, 
4,) and the phosphomethyl esters (PLl , PL2 ) of the glycolipids, respectively. 



4.1. Mass spectrometric analysis (CI-MS and MALDI-MS) 
4AA. EI-MS and CI-MS (DIP-mode) 

Mass spectrometric analysis of the neutral glycolipids was carried out with a Hewlett Packard 
mass spectrometer (Model 5989) using the direct insert probe (DIP) mode. The sample was 
evaporated from 80°C to 325°C at a rate of 30°C/min. 

While all per-O-acetylated di-(2), tri- (3) and tetrahexosyl-(4) diacylgycerolipids could be 
analyzed by MS analysis in the DIP mode directly, the two phospholipids PLl and PL2 could 
not be analyzed by this technique. Due to the high polarity and the complexity of the 
molecule, the phospholipids were, therefore, dephosphorylated with hydrogen fluoride (48% 
HF, 4°C, 20h) prior to MS analysis, the dephosphorylated fragment was per-O-acetylated and 
only after this treatment analyzed by mass spectrometry (DIP mode). Electron impact spectra 
(EI-MS) were recorded at 70 eV and chemical ionization spectra (CI MS) were obtained using 
ammonia (0.5 torr). 

In the DIP MS analysis all per-O-acetylated di-(2), tri- (3) and tetrahexosyl-(4) ' 
diacylgycerolipids in the EI mode showed characteristic fragments for terminal mono-hexosyl 
(m/z = 331) and di-hexosyl (m/z = 619) residues and differed from each other in the 
evaporation rate maximum (9.5 min, 2, 10.6 min, 3 and 12.0 min, 4). The disaccharid 2 
showed in the CI MS a pseudomolecular ion [M + NPt^]*^ at m/z = 1202, wherein 
hexadecanoyl (16:0) and hexadecenoyl (16:1) could be identified as the fatty acid residues. In 
addition, a second ion [M -i- NH4]'^ at m/z = 1230 was observed, which could be identified as 
disaccharide with 16:0 and 18:1 (or 18:0 and 16:1) as fatty acids. The amounts of these 
differently acetylated diglycosyl lipids were present in a relative proportion of 2:3. 

The trisaccharide 3 showed the expected pseudomolecular ion [M + NH4]'^ at m/z = 1490 and 
1516 with the same heterogeneity in its acylation pattem, however in a slightly different 
proportion (2:1). 

The tetrasaccharide 4 showed an evaporation profile with an increased maximum in the 
evaporation time (12.0 min) in comparison to 2 and 3. Pseudomolecular ions in the CI MS 
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could not be produced with this compound. The presence of the tetrasaccharide 4_could, 
therefore, only be indirectly deduced under these conditions from the characteristic fragments 
of the non-reducing glycosyl residue {m/z = 331 and 619, respectively). 

In the DIP MS (CI mode) both PLl and PL2 showed a characteristic biphasic evaporation 
profile, wherein the first maximum (~ 6 min) from the dephosphorylated and re-acetylated 
partial structures and the second maximum (---11 min) from the diagnostic pyrolysis 
fragments of the intact, not cleaved by HF hydrolysis phospholipids could be assigned. From 
this result it had to be noted that the applied reaction time (6 -20h) for removal of the 
phosphate residue with aqueous (48%) HF was not sufficient, since both phospholipids were 
cleaved only partially into their dephosphorylated partial fragments. In spite of this limitation, 
the fatty acid distribution pattern for both of the glycerol residues Gro^ and Gro" could be 
unambiguously determined. 

For PLl , the first peak revealed two pseudomolecular ions [M + NH4]'*" at m/z = 626 and 654, 
which could be assigned to the molecule ions and, thus, to the mono-acetylated glycerol 
residue Gro" with 16:0 and 16:1 (M = 608) and a 16:0 and 18:1 (or 18:0 and 16:1) (M = 636), 
respectively. Thus, this peak contained the expected product of the HF treatment and re- 
acetylation. In contrast, an analogous reaction product deduced from Gro* (with Glc as the 
substituent) was not observed. 

The second peak (- 11.0 min) revealed three fragments, which in contrast to the first peak 
represented only pyrolysis products of PLl, produced during MS analysis, but no intact 
derivatives. The first ion (m/z = 447) was assigned to a fragment deduced from Gro\ 
substituted with a peracetylated Glc and with an acetyl residue. The two other fragments (m/z 
= 549 and 577) originate from Gro". They carried a diacylglycerol, wherein each a palmitic 
acid (16:0) and a palmitoleic acid (16:1) were esterified (m/z = 549), and a second fragment 
(m/z = 577) with each a palmitic acid (16:0) and an oleic acid (18:1) (alternatively 16:1 and 
1 8:0). Already from this fragmentation pattem it could be concluded that PLl is an 
"asymmetrically acylated" phospholipid, because Gro" carries both fatty acids, and Gro' in the 
native phospholipid with a free hydroxy! group and a glucose appears to be relatively 
hydrophilic. 



- 13 - 



The dephosphorylated and per-O-acetylated PL2 also showed a biphasic evaporation profile 
in the DIP MS analysis. The first peak had an evaporation maximum (- 6 min) and a 
fragmentation pattern identical with PLl , indicating that PL2 is identical with PLl with 
respect to the fatty acid substitution pattern in Gro^^ In contrast, the second peak (1 1 .0 min) 
revealed four fragmentations. The first couple (w/z = 550 and 577) was identified as a 
pyrolysis fragment deduced from Gro" with 16:0 and 18:1 (16:1 and 18:0, respectively) and 
was therefore analogous to PLl. The second couple {m/z = 1202.9 and 1231.1) could be 
identified as [M + NH4]'^ ion of an intact derivative, produced from PL2 after de- 
phosphorylation and re-acetylation. This PL2 derivative is a diglucosyl diacylglycerol being 
further esterified with two fatty acids 16:0 and 16:1 (M = 1 185) and 16:0 and 18:1 (M = 1203) 
respectively, as well as an acetyl residue (on the initial phosphate position). Although it was 
possible to assign the fatty acid substitution pattern to the two glycerol residue Gro' and Gro" 
by these analyses, the exact substitution pattern of the fatty acids could only be determined by 
NMR analysis (see below). Due to thermal instability of the molecules, the DIP MS analyses 
of the two intact phospholipid derivatives PLl and PL2 could not be completely analyzed and 
were therefore further examined by means of MALDI-TOF-MS analysis. 

4.1.2. MALDI-TOF-MS 

MALDI-TOF-MS analyses were carried out on a Bruker Reflex II spectrometer in the 
reflector mode at an acceleration potential of 20 kV by means of the "delayed ion extraction" 
in the positive mode. The per-O-acetylated and phosphomethylated samples of PLl and PL2 
were re-suspended in chloroform (10|ig/ml) and 2 |al solution thereof were mixed with 2 |al of 
a matrix solution (0.5 M 2,4,6-trihydroxyacetophenone; Aldrich, Steinheim). An aliquot of 
this mixture (0.5 |il) was applied to a metal support, dried with warm air and immediately 
thereafter placed into the spectrometer. Calibration of the spectra was performed using an 
internal standard (Angiotensin). AH mass data apply exclusively to the monoisotopic mass of 
the molecules. 

In each case, two derivatives of both phospholipids were analyzed by MALDI-TOF-MS: the 
free phosphoric acid derivatives and the phosphomethyl esters (PLl and PL2, see Fig. 2). The 
free phosphoric acid derivative of PLl showed prior to esterification of the phosphate residue 
(diazomethane) in the positive reflector mode a pseudomolecular ion [M-H+Na]"^ at m/z = 
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1 1 16,48, corresponding to the calculated formula C54H95O20P (M = 1094,56). The non- 
esterified phosphoric acid derivative of PL2 showed under the same conditions a 
pseudomolecular ion [M-H+Na]"*" at m/z = 1796,43, corresponding to the formula 
C94H167O28P (M ^ 1775,06) and thus carrying in addition a hexose and a fatty acid (16:0 
and/or 18:1) in comparison to PLl . 

Note: The mass of the MALDI-TOF-MS analyses given herein in all cases only relate to the 
smallest monoisotopic pseudomolecule ion or mass fragment. That means, in all mass data 
only the smallest fatty acid (16:0) was considered (see Fig. 2). All pseudomolecular ions 
presented herein showed per fatty acid always a heterogeneity resulting from the exchange of 
16:0, 16:1, 18:0 and 18:1, this heterogeneity influencing all mass spectra (DIP and MALDI), 
but is not considered in the mass data provided herein. 

PLl showed after treatment with diazomethane a pseudomolecular ion [M-H+Na]^ at m/z = 
1 130,69 which corresponds to the formula C55H97O20P (M = 1 108,57) and thus contains only 
one additional methyl group (Dm/z = 14) in comparison to the free acid. PLl showed under 
the same conditions a pseudomolecular ion [M-H+Na]^ at m/z =1811 ,42, which corresponds 
to the formula C95HI69O28P (M = 1789,07) and also contains only one additional methyl 
group (Om/z = 14) in comparison with the free acid. Thus, not only the preceding MS 
analyses (DIP MS) were confirmed, but it was also unambiguously demonstrated that both 
PLl and PL2 represent phosphodiesters, which are likely to be substituted with two glycerol 
residues. In both cases only one methyl group was introduced by a diazomethane treatment 
and transformation into the corresponding methyl ester. 

4.2. Proton Nuclear Magnetic Spectroscopic Analysis (^H-NMR) 

The per-O-acetylated and purified samples (2 -4 , 30-200 ^ig) were dissolved in 100 |al CDCI3 
(99.96% Cambridge Isotope Laboratories, Andover, MA, USA), and transferred into special 
capillary NMR microtubes (2.5 mm OD, Wilmad, Buena, NJ, U.S.A.). The proton spectra 
(^H-NMR) were recorded on a 600 MHz Spectrometer (Bruker Avance DRX 600), equipped 
with a special microprobe head (PH TXI 600SB). The samples were measured at 300K with 
reference to internal trimethlysilane (TMS, 5h = 0.000 ppm). One- and two-dimensional 
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homonuclear spectra (^H, 'H COSY, ROESY, and relayed COSY) were performed using 
standard Bruker software (XWINNMR, Version 1 .3). 

The one-dimensional (ID) ^H-NMR spectra (600 MHz, microprobe head) of the di-(2i - 200 
|ag), tri- (3 ^ 200 |ig), and tetrahexosyl diacylglycerolipids (4_^ 50 ng) of compounds 2, 3, 
and 4_are shown in Figures 3 a -c and the results are set forth in Table 1 (Armex). 

Assignment of the signals was carried out by ID and two-dimensional (2D) proton nuclear 
magnetic resonance spectrometry (*H, *H COSY, relayed ^H, *H COSY, ROESY) in 
comparison with the structurally related p-gentiobiose octaacetate (T) which served as a 
reference substance for unambiguous assignment and which is therefore also included in 
Table 1. The P-anomeric configuration of all hexoses in the substance results from the 
coupling constants J\^2 being between 7.6 and 8 Hz for all glucoses. The other coupling 
constants of the pyranosidic ring protons H-2, H-3, and H-4 and H-5 (J2,3, Jsa ^4,5) were all 
larger than 9.5 Hz, indicating glucopyranose. The chemical shift of the methylene protons (H- 
6a and H-6b) as well as their coupling constants (J6a, 6b) in the terminal Glc residue (A) were 
found to be identical in all oligosaccharides (4.062 ±0.005 ppm for H-6a and 4.205 ±0.005 
ppm for H-6b) as compared with those of H-6a,6b in residue A of the P-gentiobiose 
octaacetate, thus allowing the assignment of the spin systems of all terminal Glc residues A in 
the oligosaccharides 2, 3, and 4 on the one hand, but also in PLl and PL2 . 

The p(1^6)glycosidic bond could be determined by means of the shift towards a higher field 
of the (overlapping) signals of H-6a and H-6b in residues B, C and D (3.855 ±0.05 ppm) since 
these signals clearly differed from the non-substituted methylene signals of the terminal H- 
6a,6b (A). This fact clearly indicates that all Glc residues of the compounds 2, 3, and 4_are 
identically, i.e. P(1^6) glycosidically interlinked. This observation could be confirmed by 
means of a two-dimensional spectrum (^H, *H COSY, Figure 4, bottom) and a nuclear 
Overhauser spectrum (rotating-fi-ame NOE spectroscopy, ROESY, Figure 4, above) of 
trisaccharide 3. A ROESY spectrum showed (indicated) cross-peaks of the anomeric H-1 
protons H-1 ^ H-1 ^ , and H-1 ^ which could be observed between H-1^ /H-6a,6b^, H-1^ /H- 
6a,6b*^, and H-IC /H-3a,3b^*^° (Fig. 5), allowing an unambiguous assignment of the three spin 
systems to each of the glucosyl residues A, B and C. 
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In addition to the glycosyl residues in all *H-NMR spectra, signals of the glycerol moiety ( H- 
la, Ib^'^ , and H-3a,3b^^ ) could also be identified (Fig. 3, Table 1). The fatty acids 

showed the expected methylene (-CH2-, 1 .185 ppm) and methyl protons (-CHs^ 0.812 ppm). 
Finally, signals from olefinic protons (-CH =CH-, ^5.27 ppm) could also be found in all 
glycolipids 2-4, PLl and PL2, which from the MS spectra could be assigned to the 
unsaturated fatty acids 16:1 and/or 18:1. 

In accordance with the non-phosphorylated compounds 1 - 4_, NMR analyses were carried 
out only with the per-O-acetylated mono methyl ester derivatives. Samples (0.1 - 0.2 mg) 
were dissolved in 500 |al CDCI3 (99.96 % Cambridge Isotope Laboratories, Andover, MA, 
U.S.A.) and measured in 5 mm NMR tubes (Ultra Precision NMR sample tubes , Isocom, 
Landshut) at 300 K. Proton and Phosphorous-3 1 spectra (*H- and ^^P NMR) were recorded 
with a 600 MHz spectrometer (Bruker Avance DRX 600) equipped with an inverse probe 
head (5 mm TXI 13C), and the carbon 13 (^^C NMR) spectra were recorded with 360 MHz 
Bruker AM spectrometer (5n[Uii dual probe head) at 90,6 MHz. The chemical shift was 
measured with reference to intemal tetramethylsilane (TMS, 5h = 0.000 ppm) and chloroform 
(CHCI3, 6c = 77.00 ppm), respectively. ^*P NMR spectra were recorded at 242.9 MHz and 
calibrated with reference to an extemal standard (85% H3PO4 = 0.0 ppm). One (ID) and two- 
dimensional (2D) homonuclear spectra (*H, *H COSY, NOESY, and relayed COSY) and 
heteronuclear spectra [*H, *^C and *H, ^*P HMQC ( heteronuclear multiple quantum 
coherence) as well as ^H, *^C HMBC (hetero multiple bond correlation)] were recorded with a 
standard Bruker software (XWINNMR, Version 1.3). 

In the ^H-NMR spectrum (Fig. 6 and Table 2), PLl showed, in accordance with MGlcD: {3- 
[O-p-D- glucopyranosyl]-l,2 diacylglycerol}, characteristic signals corresponding to a p- 
glycosidically bonded Glc residue. Surprisingly, the anomeric proton H-1 was cleaved into a 
couple of signals (H-1 and H-1') with a similar intensity (H-1, 4.461 ppm, Ji,2 7.9 Hz; H-T , 
4.457 ppm, Ji,2 7.9 Hz. While the other protons (H-2, H-3, H-4, H-5, H-6a,b) showed identical 
chemical shift and coupling constants as compared with MlcD, the protons of H-3 a ^ and 
H-3b ^'"^ ' (3.62 and 3.88 ppm) on the one hand and H-la " and H-lb " (4. 1 1 and 4.3 1 
ppm) on the other hand were split: fine resolution of the two other methylene proton signals 
of the glycerol residues I and II (H-la,b*^'°' and H-3a,b^'''" 4.08 - 4.14 ppm) could, 
however, not be observed. Further, we observed two singular methine protons for H-2*^^° ^ 
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(ddd, 5.082 ppm, 5.3 Hz) and H-2^'°" (ddd, 5.168 ppm, 5.3 Hz) as to be expected for a 
diglyceride. 

Further characteristic signals were of the methyl group of the phosphate ester, which also 
exhibited a characteristic spUt doublet (POCH3, 3.828 and 3.810 ppm) with a characteristic 
Jh,p coupling of 1 1 .2 Hz. A phosphate monomethylester, PLl could be identified as a 
phosphodiester via the integral of the signal of the phosphomethyl group (3H). This 
confirmed the results of MS analyses. Finally, 5 OAc signals could be detected (2.026, 2.018, 
1 .989, 1.954, 1.934 ppm; all s), which led to the conclusion that, besides the four OAc groups 
of the terminal Glc residue, presumably a fifth OAc group was bound to one of the two 
glycerol residues. The accurate fatty acid distribution pattem could be partially determined 
using an HMBC experiment. A fatty acid in position sn-l of glycerol residue II could be 
assigned via the connectivities of the a-methyl protons of the fatty acids (-O-COCH2-). 
However, due to the small amount of substance, the substitution of the second fatty acid could ' 
not be determined in the HMBC experiment and could only be investigated based on MS 
analyses. 

In ^^P NMR (Figure 7 and Table 3), the phosphate signal of PLl is split (0.514 and 0.444 
ppm) and appears as a singlet in the decoupled spectrum. The *H,^'P-HMQC experiment (i) 
showed the expected connectivity with the phosphomethyl ester group (3.828 and 3.810 
ppm), and (ii) revealed two methylene protons of glycerol residue I (H-la,b^''° ^ and H-3a,b*^''° 
")(- 4.08 - 4.14 ppm) to which the phosphomethyl group is bound. Hence, the phosphate 
substitution could be determined. Thus, based on this experiment, the connection of glycerol 
residues I and II via a phosphate diester could be proven, which could already be assumed due 
the presence of phosphomonomethyl ester in NMR analysis and due to characteristic 
fragments in MS analysis. 

The splitting of the signals of H-1*^'^ H-3a,b*^'''^ and H-la,b^''*" is especially notable. This 
anomaly in *H NMR can be explained by the presence of a pair of diastereomers of PLl . By 
introducing a methyl group, the prochiral phosphate (R-O-PO(OH)-OR') in the middle of the 
molecule becomes chiral (R-O-PO(OMe)-OR') which resuhs in two diastereomeric 
phospholipids PLl and PL1\ A corresponding chirality of the phosphorous atom was already 
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observed and described for other phospholipids in ^H-, ^^C and P-NMR spectra (Bruzik et 
al., 1983). 

Phospholipid 2 (PL2) showed the same characteristic splitting of two anomeric protons in ^H- 
NMR. In this case, the anomeric protons are of the two glucose residues Glc"^ 
and Glc^ (H-1^ and H-T^ ; 4.647 and 4.635 ppm, Ji,2 7.7 Hz) and H-1^ and H-1'^ (4.533 and 
4.524 ppm, J\;i 1,9 Hz). These splittings are characteristic for diastereomeric pairs analogous 
to PLl . Thus, a structural relationship of both phospholipids and, as a consequence, a 
correlation in the biosynthesis of these phospholipids can be assumed. The ^H-NMR spectrum 
of PL2 showed high similarity with the one of DGlcD (2) which made the interpretation of 
the NMR spectra and thus the structural analysis easier. By comparison of both *H-NMR 
spectra, the substitution of the fourth acyl residue (16:0 or 18:1) in position of terminal 
glucose could be determined, (The third acyl residue is connected with the C2 group of Gro'). 

Further characteristic signals were of the phosphomethyl ester, which also exhibited a split 
doublet revealing the chirality of the phosphate residue, and thus the diastereomeric nature of 
the molecule (3.835 and 3.81 8 ppm, Jp,h 1 1 .2 Hz). In ^'P-NMR, the phosphate signal of PL2 
was also split (0.414 and 0.275 ppm) analogous to PLl , which is characteristic for the 
diastereomeric pair PL2 and PL2' (Bruzik et al., 1983). 

In conclusion, our MS (DIP and MALDI) and 1H-, 13C- and 31 P-NMR analyes 
unequivodcally identified three neutral and two inogenic glyolipids, which could be identified 
as di-, tri-, and tetrasaccharide-diacylglycerols 2, 3, and 4 with the following structure in the 
glycosyl moiety (Figure 8): 

P-D-Glcp-( 1 ^6)-P-D.Glc;7-( 1 -^6)-Gro (2) (DGlcD), 
P-D-Glcp-(1^6)-p-D-Glc/?-(l->6)-p-D-Glcp-(1^6)-Gro (3) (TGlcD), and 
p-D-Glc/?-( 1 ^6)-p-D-Glc/7-( 1 ->6)-P-D-Glc/?-( 1 ^6)-P-D-Glc;7-( 1 ^6)-Gro (4) (TeGlcD), 
as well as the phospholipid l:3-[0-P-D-glucopyranosyl]-^n-glycerol-13'-phospho-r,2'- 
diacyl-j:«-glycerol (PLl) and the phospholipid 2: {3-[0-(6'"-0-acyl)-p-D-glucopyranosyl- 
(1 "'— >6")-0-p-D-glucopyranosyl]-2-acyl-5A7-glycerol-l ,3'-phospho-l '2'-diacyl-5«-glycerol} 
(PL2) (Figure 9). 
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5. Enzyme Assay 

Standard enzyme assays for determination of the activity of the processive glycosyl 
transferase were performed in a final volume of 100 containing buffer 1, 20 |il E. coli 
BL21 (DE3) pEypfP24 and pEsay24 membrane fraction (20-40 ^ig of protein) and 250 000 
dpm UDP-[^'^C]-glucose (specific activity 10.8 GBq/mmol; 3.85 \xM final concentration). The 
reaction was carried out for 1 h at 30°C and stopped by the addition of chloroform/methanol 
(2:1; 2 ml). The organic mixture was washed with 0.7 ml of NaCl solution (0.45% (w/v)) and 
the resultant subphase recovered. An aliquot of the subphase was subjected to scintillation ' 
counting, and after removal of the solvent by evaporation with argon, the remaining part was 
used for separation by thin-layer chromatography. 

Detergents such as octyl-p-D-glucopyranoside (Sigma), decyl-P-D-glucopyranoside ■ 
(Sigma),SDS, Chaps (Signa), Tween 20, dodecyl-P-D-maltoside (Sigma) and sodium cholate 
(Sigma) were added in concentrations according to twice their critical micellar concentration 
(this applies only to the processive glycosyl transferase from B. subtilis ypJP), 

Ceramide was added as fluorescent D-erythro-C6-NBD-ceramide (Matreya, INC.), 
cholesterol was added as [4-''^C Jcholesterol and steryl glucoside was added as ^'^C-labelled 
steryl glycoside (Fig. 10/11). Radioactive products on thin-layer chromatography plates were 
detected by radio scanning (BAS-1000 Bio Imaging Analyzer, Fuji) (see Figure 10). 

Assays with UDP-[^'*C]glucose showed the highest incorporation of radioactivity with 
membrane fractions compared with the soluble and inclusion body fractions of E. coli BL2 1 
(DE3) pE>5^7/P24 and pEsay24. Therefore, all subsequent in vitro standard assays were carried 
out with membrane fractions and UDP-[^'*C]glucose. The [^"^Cl-labeled lipophilic products 
counted for 70 -80 % of the label offered in the assay. Separation by TLC was used to 
identify lipophilic radioactive products, using a monogalactosyl diacylglycerol (MGD), 
DGlcD and TGlcD as non-radioactive standards. The highest proportion of radioactivity was 
found in DGlcD, whereas labeling of MGlcD and TGlcD (TeGlcD only for B. subtilis) was 
low (Fig. 1 1). Assays with membrane fractions of the untransformed E. coli did not show 
incorporation of radioactivity into lipophilic products. To increase the DAG concentration in 
the enzyme assay, the effects of several detergents on the enzymatic activity were tested. With 
the exception of lyso-PC (Sigma) and alkyl-p-D-glucopyranosides, the addition of all above- 
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mentioned detergents resulted in complete inhibition of enzymatic activity. [ C]-MGlcD and 
[^'^CJ-DGlcD from assays with transformed E. coli were isolated and subjected to various 
chemical and enzymatic treatments to identify their structure. 

The DAG moiety in ['^C]DGlcD was confirmed by treatment with Rhizopus lipase. This 
lipase specifically releases the fatty acid from the sn 1 -position of the DAG-containing lipid. 
As expected, the resulting radioactive product co-migrated with a lyso-DGlcD that had been 
prepared from non-radioactive DGlcD by the same treatment. Incubation of [^'^CJDGlcD with 
sodium methylate resulted in the release of a free fatty acid methyl ester and [^'^C] glucosyl 
diacylglycerol, the same products were produced when using non-radioactive DGlcD of 
known structure. Characterization of the linkage between the first glucose and the DAG was 
carried out by incubation of the labeled MGlcD with cerebrosidase. This enzyme is specific 
for the p-glucosidic linkage, but is relatively unspecific for the hydrophobic part of its 
substrate (Vanderjagt et al., 1994). The incubation of [^"^C] glucose-labeled MGlcD with 
cerebrosidase resulted in the release of labeled glucose and unlabeled DAG. The success of 
the hydrolysis was measured by scintillation counting of the aqueous and organic phase after 
phase partitioning. 90% of the label was found in the aqueous phase as compared with 1 5% in 
the control experiment, in which 85% of the radioactivity was recovered as [^'^ClMGlcD in 
the organic phase. These results support the assumption of a P-glucosidic linkage between the 
first glucose and DAG in MGlcD. 

6. Characterization of glycosyltransferase activity 

The formation of three different radioactive products in the in vitro enzyme assay raises the 
question whether all of these products are produced by a single enzyme coded hy ypj? genes. 
To answer this question, three of the possible sugar acceptors were incubated separately in 
labeled form with unlabeled UDP-glucose in the presence of the membrane fraction. The 
sugar acceptors were isolated from previous assays. Assays with radioactive ['"^CJDAG 
[^^C]MGlcD and [*^C]DGlcD were performed by sonification of the radioactive substrates in 
0.5 mN lyso-phosphatidylcholine (for ['"^CJDAG) or in ethanol before adding the membrane 
fraction, buffer 1 and UDP-glucose (3.6 mM final concentration). The maximum ethanol 
concentration in assays was 5 % (v/v). After conversion of the substrates, the lipophilic 
products were separated by TLC and detected by radio scanning (Fig. 12). ["•C]DAG was 
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converted to [^^C]DGlcD and [*^C]TGlcD, [^^C]MGlcD to ['"^ClDGlcD and ['"^ClTGlcD and 
[^'^ClDGlcD to [^'^ClTGlcD. Conversion of radioactive labeled DGlcD to TGlcD did not occur 
any more with the S, aureus enzyme. Control experiments using the same substrates and 
untransformed E. coli membrane fractions did not result in any of the mentioned products. 
The results suggest processivity of the enzyme, whereby the starting reaction can be described 
as a UDP-glucose: l,2-diacylglycerol-3-P-D-glycosyl transferase reaction. In subsequent 
reaction steps, however, the glucose acceptors vary and represent the products of previous 
additions of p-glucosyl residues. 

To exclude a reaction mechanism based on the transfer of glycosyl residues from glycosides 
to various acceptors, as observed for glycosidases, the enzyme assay was carried out in the 
presence of a radioactively labeled MGlcD, but in the absence of UDP-glucose. No 
conversion of a radioactively labeled MGlcD could be observed. Incubation of YpfP with the 
glucosidase inhibitor deoxynoijrimycin (Alexis Deutschland GmbH) and substance 3 
(provided by Dr. Y. Ichikawa) was performed as described by Ichikawa and Igarashi, 1995. 
These compounds interfere with the transfer of glucose in reactions catalyzed by glucosyl- 
hydrolases, but not with the transfer of a sugar nucleotide-dependent glucdsyltransferases. 
None of the inhibitors was able to inhibit the enzyme reaction. Both approaches suggest a 
transfer of glucose by a sugar nucleotide-dependent reaction. On the other hand, ricinoleic 
acid and oleic acid were able to inhibit the enzyme, inhibition varying with the concentration 
in the assay. Additions between 25 and 50 |ig in 100 |al assay volume resulted in inhibition of 
DGlcD and TGlcD formation, the second and third step of the enzyme reaction. In these 
experiments, MGlcD accumulated in the assay, whereas MGlcD accumulated in normal 
assays in a low amount. Concentrations above 50 |ig in the assay led to a complete inhibition 
of the enzyme. Hydrolysis experiments with sodium methylate excluded the possibility that 
ricinoleic acid (=12-D-hydroxy-oleic acid) was glucosylated. 

7. Substrate specificity 

Substrate specificity was characterized regarding the sugar donor and the sugar acceptor. 
Apart from UDP-[*^C]glucose, UDP-[^^C]galactose was also tested, but galactose was not 
incorporated into lipophilic products. Experiments concerning the sugar acceptor showed that 
besides DAG, MGlcD and DGlcD also alkyl-p-D-glucopyranosides can serve as acceptor 
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(this applies only for the B. subtilis enzyme). This resulted in products, which tentatively have 
been identified as alkyl diglucosides. However, the only evidence available so far are the Rf- 
values of the resulting products and their stability towards alkaline hydrolysis. Neither alkyl- 
a-D-glucopyranoside nor alkyl-p-D-glucopyranoside could serve as acceptor. The S. aureus 
enzyme could convert sterol, as well as steryl glucoside (Figure 10). This data shows that the 
YpfP enzymes are less specific concerning the sugar acceptor, but have a higher specificity 
for the sugar donor UDP-glucose. 

General cloning and transformation techniques 

The recombinant DNA molecules according to the invention can be produced by standard 
techniques, as, for example, described in the Laboratory Manual by Sambrook et aL, vide 
supra. Also, production of transgenic cells and organisms can be performed using 
conventional transformation methods, well-knovm in the art. This applies to microorganisms 
and yeast, as well as to plants. For introducing DNA into a plant host cell several techniques 
are available and the person skilled in the art can easily choose a suitable transformation 
procedure. These techniques comprise the transformation of plant cells with T-DNA using 
Asrobacterium tumefaciens or Asrobacterium rhizosenes as transformation means, fusion of 
protoplasts, direct gene transfer of isolated DNA into protoplasts, microinjection or 
electroporation of DNA, introducing DNA via biolistic methods and other procedures. In an 
altemative embodiment of the invention the nucleic acid molecules of the invention can be 
introduced into plant cells via viral infection. These techniques are all described in the 
literature, as are suitable binary vectors and expression vectors. 

For constructing the recombinant nucleic acid molecules according to the invention, the 
skilled person can use any DNA sequence that codes for a protein having the biological 
activity of a processive diacylglycerol glycosyltransferase, including DNA sequences which 
hybridize with the DNA sequences disclosed herein. In the context of the invention, the term 
"hybridization" means a hybridization under convential conditions, preferably under stringent 
conditions, as e.g. described in Sambrook et aL (1989), Molecular Cloning: A Laboratory 
Manual, 2nd edition. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York. 
Nucleic acid molecules that hybridize with the molecules of the invention may isolated e.g. 
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from genomic or cDNA libraries. Identification and isolation of such nucleic molecules can 
be carried out using the nucleic acid molecules of the invention or fragments or reverse 
complements thereof, e.g. by hybridization under standard conditions (Sambrook et aL, 
supra) . For instance, nucleic acid molecules which display exactly or essentially the nucleic 
acid sequences of the invention or portions thereof can be used as hybridization probes. Also 
synthetic fragments, which are synthesized using common synthesis processes and which 
correspond basically to one of the DNA sequences or nucleic acid molecules of the invention 
can be used as hybridization probes. When genes are identified and isolated, which hybridize 
to the DNA sequences of the invention, it is necessary to determine their sequence and the 
sequence and characteristics of the proteins encoded by them. The man skilled in the art has a 
variety of biochemical, biotechnological and genetic engineering methods for the 
characterization of the nucleic acid molecules and the proteins at his disposal. The molecules 
that hybridize to the nucleic acid molecules of the invention comprise also fragments and 
(degenerated or allelic) derivatives of the nucleic acid molecules described herein. The terms 
"derivatives" means in this context that the sequences of such molecules differ from the 
molecules of the invention (as herein described) in one or more positions and show a high 
degree of homology to the sequences provides by the invention. Homology means an identity 
in sequence of at least 40 %, in particular of at least 60 %, preferably of more than 80 % and 
more preferably of more than 90%. The deviations from the sequences of the invention may 
be generated by deletion, addition, substitution, insertion or recombination. 



Abbreviations 




AA 


Amino acid 


DAG 


Diacylglycerol 


DGlcD 


Diglucosyl diacylglycerol 


DHexD 


Dihexosyl diacylglycerol 


DNA 


Deoxyribonucleic acid 


Glc 


Glucose 


MGD 


Monogalactosyl diacylglycerol 


MGlcD 


Monoglucosyl diacylglycerol 


MHexD 


Monohexosyl diacylglycerol 


PAGE 


Polyacrylamide gel electrophoresis 


PG 


Phosphatidylglycerol 


SDS 


Sodium dodecyl sulfate 


TeGlcD 


Tetraglucosyl diacylglycerol 


TGlcD 


Triglucosyl diacylglycerol 
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THexD Trihexosyl diacylglycerol 

TeHexD Tetrahexosyl diacylglycerol 

PLl Phospholipid 1 

PL2 Phospholipid 2 



Nucleotide Sequence 
B. suhf^lis ypfP 

ttgaatacca ataaaagagt attaattttg actgcaaatt acggaaatgg acatgtgcag gtagccaaaa cactttatga 
acaatgtgta cggctcggct ttcagcatgt aacagtttct aatttgtacc aagagtcaaa tccgattgtt tcagaggtaa 
ctcaatacct ttatttaaaa agcttctcaa tcgggaaaca gttttatcgt ttgttttatt acggagttga caaaatctat aataaacgta 
aattcaatat ttactttaaa atgggtaata aaagattggg cgaacttgtc gatgaacatc agcccgatat tattattaat 
acatttccga tgatcgtcgt gccggaatac agacgccgaa ctggaagagt cattcctacc ttcaacgtta tgactgattt 
ttgtcttcat aaaatttggg ttcacgaaaa cgtggataaa tattatgtgg cgacagatta cgtgaaggaa aaactgctgg 
agatcggcac tcatccaagc aatgtaaaaa tcacaggaat tccaatcagg ccgcaatttg aagaatccat gcctgttggc 
ccgatatata aaaagtacaa tctttcacca aacaaaaaag tgcttctgat catggcaggt gctcacggtg tattaaagaa 
cgtaaaagag ctgtgcgaaa accttgtcaa ggatgaccaa gtgcaagtag ttgtcgtgtg cgggaaaaat acggctttaa 
aagaatcttt gagtgcgctt gaagcggaaa atggtgacaa attaaaagtt ctgggctatg tggagcgcat tgatgagcta 
tttcggatca cagattgcat gattaccaag cccggcggca ttactttgac agaagccaca gccattggag tgcctgtcat 
tctgtacaaa cccgtgcctg gccaggaaaa agaaaatgca aacttctttg aagaccgcgg agctgccatc gttgtgaacc 
gtcatgaaga gattctcgag tcagtcactt cccttcttgc agatgaagat accttgcatc gcatgaagaa aaacattaag 
gaccttcatt tagcaaactc ctctgaagtg attttagagg atatcctgaa ggaatcagaa atgatgaccg ccaaacaaaa 
agccaaagtg ctatcgtaa 

S. aureU^ypfP 

Atggttactca aaataaaaag atattgatta ttactggctc attcggtaac ggtcatatgcaagttacaca gagtatcgtt 
aatcaactta atgatatgaa tctagaccat ttaagcgtcattgagcacga tttatttatg gaagctcatc caattttgac 
ttctatttgt aaaaaatggt atatcaatag ctttaaatat tttagaaata tgtacaaagg gttttattac agccgcccag 
ataaactaga caaatgtttt tacaaatact atggacttaa taagttaatt aatttattgataaaagaaaa gccagattta 
atattattaa cgtttcctac accagttatg tcggtactaa ctgagcaatt taacattaat attccagttg ctacagtgat 
gacagactat cgcttacata aaaactggat tacgccgtat tcaacaagat attatgtggc aacaaaagaa acgaaacaag 
acttcataga cgtaggtatt gatccttcaa cagttaaagt gacaggtatt cctattgata acaaatttga aacgcctatt 
aatcaaaagc agtggttaat agacaacaac . ttagatccag ataagcaaac tattttaatg tcagctggtg catttggtgt 



-25 - 



atctaaaggt tttgacacga tgattactga tatattagcg aaaagtgcaa atgcacaagt agttatgatt tgtggtaaga 
gcaaagagct aaagcgttct ttaacagcta agtttaaatt aacgagaatg tatttgattc taggttatac caaacacatg 
aatgaatgga tggcatcaag tcaacttatg attacgaaac ctggtggtat cacaataact gaaggtttcg cccgttgtat 
tccaatgatt ttcctaaatc ctgcacctgg tcaagagctt gaaaatgcct tttactttga agaaaaaggt tttggtaaaa 
cgctgatac tccag 

Amino acid sequence 
B. su^lis YpfP 



MNTNKRVLIL TANYGNGHVQ VAKTLYEQCV RLGFQHVTVS NLYQESNPIV 
SEVTQYLYLK SFSIGKQFYR LFYYGVDKIY NKRKFNIYFK MGNKRLGELV 
DEHQPDIIIN TFPMIVVPEY RRRTGRVIPT FNVMTDFCLH KIWVHENVDK 
YYVATD YVK EKLLEIGTHPS NVKITGIPIR PQFEESMPVG PIYKKYNLSP 
NKKVLLIMAG AHGVLKNVKE LCENLVKDDQ VQVVVVCGKN TALKESLSAL 
EAENGDKLKV LGYVERIDEL FRITDCMITK PGGITLTEAT AIGVPVILYK 
PVPGQEKENA NFFEDRGAAI VVNRHEEILE S VTSLLADED TLHRMKKNIK 
DLHLANSSEV ILEDILKESE MMTAKQKAKV LS 

S. aureus YpfP 

MVTQNKKILI ITGSFGNGHM QVTQSIVNQL NDMNLDHLSV lEHDLFMEAH 
PILTSICKKW YINSFKYFRN MYKGFYYSRP DKLDKCFYKY YGLNKLINLL 
IKEKPDLILL TFPTPVMSVL TEQFNINIPV ATVMTDYRLH KNWITPYSTR 
YYVATKETKQ DFIDVGIDPS TVKVTGIPID NKFETPINQK QWLIDNNLDP 
DKQTILMSAG AFGVSKGFDT MITDILAKSA NAQVVMICGK SKELKRSLTA 
KFKLTRMYLI LGYTKHMNEW MASSQLMITK PGGITITEGF ARCIPMIFLN 
PAPGQELENA FYFEEKGFGK lADTPEEAIK IVASLTNGNE QLTNMISTME 
QDKIKYATQT ICRDLLDLIG HSSQPQEIYG KVPLYARFFV K 
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FIGURES: 

Figure 1 . Expression of the bacterial processive glycosyltransferases results in the 
biosynthesis of glycolipids in E. coli transformants. Lipid extracts were separated by TLC in 
chloroform: methanol: H2O (70:30:4). Total lipids were detected with ANS under UV and 
marked with pencil, glycolipids were detected with a-naphthol/H2S04 and tentatively 
identified by co-chromatography with standards. 

lane \ \ E. coli BL21 (DE3), control 

lane 2: E. coli BL21 (DE3) ipEypfP2A, expressing the B, subtilis gene 
lane 3: B. subtilis lipid extract 

lane 4: E. coli BL21 (DE3) pE5'ay24, expressing the S. aureus gene 
lane 5: standards, MGD, DGlcD and TGlcD 

Figure 2. Negative ion mode MALDI-RE-TOF mass spectra of PLl (top) and PL2 (bottom). 
The dominating molecular species of PLl (top, [M-H]+ at m/z = 909) contains palmitic (16:0) 
and vaccenic acid (18:) residues, whereas the prevaling species of PL2 (bottom, [M-H]+ at 
m/z=1541) contains two palmitic (16:0) one palmitoleic (16:1) and one steraic acid (18:0) 
residue. The other species are described in the text. 

Figure 3. Partial ^H-NMR spectra (600 MHz, CDCI3, 300K) of per-O-acetylated di-(2), tri- 
(3), and tetraglucosyl diacylglycerol (4). 

Figure 4. Part of a 2D ROESY (upper) and 2D COSY (lower) spectrum (600 MHz, CDCI3, 

300K) of per O-acetylated triglucosyl diacylglycerol 3. NOE cross peaks used to assign the 
inter-residual connectivities are indicated in the ROESY spectrum as well as cross-peaks in 
the COSY spectrum. The corresponding parts of the ID 'H-NMR spectrum are displayed 
along the axes. 

Figure 5. Part of 2D ROESY (bottom) and 2D COSY(top) spectra (600 MHz, CDCI3, 300K) 
of PLIacMc. The corresponding parts of the ID *H-NMR spectrum are displayed along the 
axes. The ROESY spectrum shows the connectivity between the anomeric proton H-l"^ of 
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Glc"^ and the methylene protons H-3a*^°^ and H-3b*^^^^. A mixing time of 250 ms was used in 
the ROESY experiment. 

Figure 6. Part of a ^H-NMR spectrum (600 MHz, CDCI3, 300K) of MGlcDAc(a), PLUcMeXb) 
DGIcDac(c), and PL2Ac,Me(d)- Only those signals are indicated which are split due to the 
chirality in the phosphate group when compairing a/b and c/d. Despite their distance to the 
phosphate group a paticularly large effect is seen for the anomeric protons of the disaccharide 
moeity of PL2. Spectra were apodized by Gaussian multiplication with LB —1 .5 and GB 0,2 
prior to Fourier transformation. 

Figure 7. Proton-decoupled ^^P-NMR spectra (242.9 MHz, CDCI3, BOOK) of PLIacMc and 
PL2Ac,Me- Two different phosphate resonances for each pair of diastereomeric phospholipids 
(P, P') of PLl (top) and PL2 (bottom) are indicative of the chiral phosphate group in both 
phospholipids. 

Figure 8. Structures of MGlcD, DGlcD, TGlcD, TeGlcD. The numbers underlined are related 
to the numbers in the text. 

Figure 9: Structures of the two diasteriomeric forms of PLIacMc and PL2Ac,Me. The chirality in 
the phosphate group resulted from the transformation of PLIac and PL2ac to their methyl 
phosphates. 

Figure 10. In vitro determination of acceptor specificities of bacterial processive 
glycosyltransferase from S, aureus. 

Membrane fractions of E, coli BL21 (DE3) ^¥.say2A were used for in vitro enzyme assays 
with different labeled substrates as described in the experimental section. The lipophilic 
reaction products were subjected to thin-layer chromatography with subsequent radioscanning 
or fluorescence-detection for NBD-labelled products. 

A) Enzyme assays with NBD-ceramide (NBD-Cer) as acceptor 

lane \ \ E. coli BL21 (DE3) control 

lane2-4: independent £*. coli BL21 (DE3) pE^ay24 
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lane 5: NBD-ceramide-standard 

The product of highest polarity present in lane 1-4 is a degradation product of NBD-ceramide. 

B) Enzyme assays with radiolabeled sugar donors or different radiolabeled lipophilic 
acceptors and cell-free extracts of E. coli BL21 (DE3) pE5'^/y24(lane 1-7). 

lane 1-3: +[*'^C]cholesterol 

lane 4-6: +[^'^C]cholesterolglucoside 

lane 7: +UDP-[*'^C]glucose 

lane 8: standard, [^'^CJcholesterolglucoside 

lane 9: standard [^"^CJcholesterol 

The labelled product with higher polarity in lane 10 was also present in E, coli cells 
transformed with pUC18 and is therefore not resulting from processive glycosyltransferase 
activity. The structural assignments are tentative and based on chromatographic behaviour. 

Figure 1 1 . In vitro demonstration of glucosyltransferase processivity of the 
glycosyltransferases from subtilis and S, aureus expressed in E, coli BL21 (DE). Cell 
extracts of B, subtilis and E. coli BL21 (DE3) expressing processive glycosyltransferase from 
B. subtilis and S. aureus were used for in vitro enzyme assays with UDP-[^'*C]-glucose. 
Internal DAG served as sugar acceptor. The lipophilic reaction products were subjected to 
thin-layer chromatography with subsequent radioscanning. The products were identified by 
co-chromatography with unlabeled standards. 

Figure 12. In vitro demonstration of glucosyltransferase processivity of the 
glycosyltransferases from B. subtilis and S. aureus expressed in E. coli BL21 (DE). 
Membrane fractions ofE. coli BL21 (DE3) expressing processive glycosyltransferase from B. 
subtilis and S. aureus were used for in vitro enzyme assays with imlabeled UDP-glucose and 
different radiolabeled acceptors as described in the experimental section. The lipophilic 
reaction products were subjected to thin-layer chromatography with subsequent 
radioscanning. The products were identified by co-chromatography with unlabeled standards. 
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